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The Baker-Campbell Hausdorff series computes the quantity 

OO 

Z(X, V) = In (e x e y ) = z n (X, Y), 

71=1 

where X and Y are not necessarily commuting, in terms of homogeneous multinomials z n (X,Y) 
of degree n. (This is essentially equivalent to computing the so-called Goldberg coefficients.) The 
Baker-Campbell-Hausdorff series is a general purpose tool of wide applicability in mathematical 
physics, quantum physics, and many other fields. The Reinsch algorithm for the truncated series 
permits one to calculate up to some fixed order N by using ( N + 1) x ( N + 1) matrices. We 
show how to further simplify the Reinsch algorithm, making implementation (in principle) utterly 
straightforward. This helps provide a deeper understanding of the Goldberg coefficients and their 
properties. For instance we establish strict bounds (and some equalities) on the number of non-zero 
Goldberg coefficients. Unfortunately, we shall see that the number of terms in the multinomial 
z n (X,Y) often grows very rapidly (in fact exponentially) with the degree n. 

We also present some closely related results for the symmetric product 

OO 

S(X, Y) = In (e x/2 e y e x/2 ) = s n (X, Y). 

n=l 

Variations on these themes are straightforward. For instance, one can just as easily consider the 
series 

OO 

L(X, Y) = In (e x e Y e~ x e~ Y ^j = ^(X, Y). 

n= 1 

This type of series is of interest, for instance, when considering parallel transport around a closed 
curve. Several other related series are investigated. 

Keywords: Commutators, matrix exponentials, matrix logarithms, Baker-Campbell-Hausdorff 
formula. 
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I. INTRODUCTION 

What is now called the Baker-Campbell-Hausdorff formula (or series) has been studied for well over a century M- 
In the current article we will discuss the relatively recently developed Reinsch algorithm , and relations between 
this algorithm and Goldberg’s results from 1956 floi - lTl ]. One version of the Baker-Campbell Hausdorff formula is 
the expansion [Tol (T3 | 


Z{X, Y) = ln(e x e y ) = ^ g(w) w, (1.1) 

W 

in terms of the rational-number Goldberg coefficients g(w), and “words” w constructed from the “alphabet” {X, Y}. 
The homogeneous multinomials z n (X,Y ) of degree n are then constructed in terms of words of length n: 

z n (X,Y)= Y, 9 (w)w(X,Y). ( 1 . 2 ) 

\w\—n 

In fact what is now called the Baker-CampbelHlausdorff theorem is the result that the multinomials zJlX , Y) are 
in fact representable in terms of nested commutators (Lie brackets). Specifically, Dynkin’s expansion [4j, [5| amounts 
(with hindsight) to the observation that [mu 

z n (X,Y) = - V g(w) [tu(X,y)], (1.3) 

\w\—n 

where [ui(X, Y)] denotes the nested commutator built from the word w{X, Y). For definiteness, we consider right- 
nested commutators of the form [ABCDEF ... ] = [A, [B , [C, [D , [E, [F,... ]]]]]]. This is sometimes called the “long 
commutator”. Note that in view of anti-symmetry and the Jacobi identity many different words w(X,Y) can map to 
the same nested commutator [w(X, Y)]. Then 


Z(X,Y) =ln(e x e Y ) =^z n (X,Y), (1.4) 

n—1 

and the goal is to find efficient ways of calculating the g(w) and/or the z n (X, Y). Since (e^e^ ) 1 = e~ Y e~ x we have 
Z(X,Y) = —Z(—Y, —X). This implies the symmetries Z2 n (X, Y) = —Z2 n {X,X) and Z2 n +\{X 1 Y) = Z2 n +i(y, X). The 
first few terms can be represented as 

Z(X, Y) = \n(e x e Y ) = X + Y + i [X, Y] + ±[X - Y, [X, Y]] - ±[X, [Y, [X, Y]]] + ... (1.5) 

Unfortunately the expansion rapidly becomes extremely unwieldy. Specifically, we shall show that the limit superior 
of the number of terms in the Goldberg expansion grows exponentially in n. Thus even though explicit computer- 
aided computations can on a modern laptop easily be carried out to n = 13, (or sometimes higher if one focusses on 
specific questions), beyond n = 8 or thereabouts the resulting formulae are simply too cumbersome to be usefully 
written down on paper. (This style of approach complements what what can be extracted by considering special-case 
commutators, as in [13].) 


II. THE SIMPLIFIED ALGORITHM: LOW-ORDER TERMS — zi TO 24 


Our simplified version of Reinsch’s algorithm is this: Suppose one wishes to calculate up to some fixed word length 
N. Construct two (N + 1) x (N + 1) matrices that are zero except for the first super-diagonal where they contain N 
distinct elements. That is: 


■ 0 

Xi 

0 

0 

0 

0 ■ 



'0 

yi 

0 

0 

0 

0 ■ 

0 

0 


0 

0 

0 



0 

0 

V2 

0 

0 

0 

0 

0 

0 

x 3 

0 

0 

, and 

Yv = 

0 

0 

0 

2/3 

0 

0 

0 

0 

0 

0 

Xn-1 

0 



0 

0 

0 

0 

• • • UN-1 

0 

0 

0 

0 

0 

0 

x N 



0 

0 

0 

0 

0 

2 IN 

0 

0 

0 

0 

0 

0 



0 

0 

0 

0 

0 

0 


( 2 . 1 ) 
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Now compute (eg, using Maple or some equivalent package), the (N + 1) x (N + 1) matrix 

Z N = In (exp(Xjv) exp(Yjv)) ■ ( 2 . 2 ) 


The first row of the matrix Z n is (essentially) the information we want. Specifically, noting that the matrix Z jy is 
strictly upper triangular, let us denote 


0 

Zi 

22 

23 

• • • zn- 1 

zn " 

0 

0 

* 

* 

* 

* 

0 

0 

0 

* 

* 

* 

0 

0 

0 

0 

* 

* 

0 

0 

0 

0 

0 

* 

0 

0 

0 

0 

0 

0 


Here the *’s denote nonzero quantities that are not of specific interest for current purposes. 

Brute force computation, (see Appendix lAl for appropriate Maple code), yields as the first four terms: 


(2.3) 


zi=x 1 +y 1 ; (2.4) 

22 = ;j(*i 2/2 - 2 / 1 * 2 ); (2.5) 

23 = j2^ XlX2y3 ~ 2x ^ x 3 + *12/22/3 + 2/1 *2*3 - 2y 1 X 2 2/3 + 2/i2/2*3); (2.6) 

Z 4 = ^-(*1*22/32/4 - 2^12/2*32/4 + 22/1*22/3*4 - 2/12/2*3*4)- (2.7) 


Already at the stage n = 5 the terms are relatively turgid to explicitly write down. (Full explicit formulae for z 5 , Zg, 
Z 7 , and zs are presented below in Section uni) The corresponding z n (X,Y) multinomials are constructed by simply 
replacing Xi —> X and yi —> Y while preserving the order of the letters. That is, for the first four terms, 


z 1 (X,Y) = X + Y; (2.8) 

z 2 (X,Y) = ^(XY-YX)- (2.9) 

z 3 (X, Y) = j^(X 2 Y - 2XYX + XY 2 + YX 2 - 2 YXY + Y 2 X); (2.10) 

z 4 (X, Y) = —{X 2 Y 2 - 2 XYXY + 2YXYX - Y 2 X 2 ). (2.11) 


The original Reinsch algorithm Q involved an extra set of N sign symbols cq £ {+1, — l}^, and an additional “symbol 
conversion stage”, which the current algorithm avoids. 

The fundamental reason the algorithm works is that Xn and Ym have been carefully constructed to not commute 
with each other, and to be linearly independent of each other. Furthermore, for m £ {1,..., N} all of the {X^) m 
and (Y N ) m are non-zero only on the m’th super-diagonal, and so are all linearly independent of each other. Indeed 
(X^) N+1 = 0 = (Y n ) n+1 so all matrix functions (in particular the matrix exponential and matrix logarithm) have 
finite Taylor series expansions. 

Specifically, let us consider the closely related 00 x 00 matrices * and d i+ \ j y, where * and y need not 

commute. These matrices possess the property that if we have a homogeneous polynomial, P, of degree to, then 
P(5i+ij x, Si+i'j y) = 5i+ m j P(*, y). Consequently when we compute the Baker-Campbcll Hausdorff series for these 
specific matrices, then V* the (i, i + N) entry will be exactly all terms of degree N of the Baker-Campbcll Hausdorff 
expansion, for arbitrary variables *, y. In particular, now truncating to (N + 1) x (TV + 1) matrices, this truncation 
naturally eliminates all terms of order greater than N ; the other non-zero entries in the resulting matrix are the terms 
of degree less than N. 

The introduction of the subscripts Xi and yi to decorate the elements of these matrices is merely a way of getting 
around the software’s implicit assumption that the variables x and y commute. The elements of the first row of Zn, 
the z n = [Zjf\ yn-i-i, are successively built up from a sum of exactly n products of the X^ and Y N . Furthermore z n 
will contain a (respectively, a yi) if and only if the corresponding string of n matrices has a X^ (respectively, a 
Yv) in its ith position. 

This completes the description of the simplified Reinsch algorithm. 
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The next step, that of replacing words w by commutators [w], involves several elementary but tricky steps. For 
instance, from the definition of the long commutator [ABCDEF ...] = [A , [B, [C, [D , [E, [F, ... ]]]]]] we immediately 
have [wX a>1 ] = 0 = [wY b>1 ], More subtly \w\W 2 \ = [wi[ 102 ]], which leads to 

[wiXYw 2 \ = [wiXY[ w 2 ]\ = [wiX[Y, [w 2 ]]] = [wi[X, [Y, [iu 2 ]]]]. (2-12) 

Applying the Jacobi identity to this last term yields 

[w 1 XYw 2 \ = -[w![Y, [[w 2 ], A]]] - [wi[[w 2 ], [A',F]]] = [tor[Y, [X, [w 2 ]]]] + [tui[[X, Y], [to 2 ]]]. (2.13) 

That is 

[w\XYw 2 ] = [w\YXw 2 ] + [iui [X,Y}w 2 }- (2-14) 

This quite general result immediately implies, for instance, [YXXY] = [XYXY]. 

When computing the commutator version of z 3 , we encounter 

[X 2 Y - 2XYX + XY 2 + YX 2 - 2YXY + Y 2 X] = [X 2 Y - 2XYX - 2YXY + Y 2 X] 

= [X 2 Y + 2 X 2 Y + 2 Y 2 Y + Y 2 X] 

= 3[X 2 Y + Y 2 X] 

= 3 [{X-Y)XY]. (2.15) 

Similarly, when computing 24 one encounters 

[. X 2 Y 2 - 2XYXY + 2YXYX - Y 2 X 2 } = [—2 XYXY + 2YXYX] = -4 [XYXY]. (2.16) 

This now yields 

zi{X,Y)=X + Y- (2.17) 

z 2 (X,Y) = ±[XY}; (2.18) 

z 3 (X,Y) = ±[{X-Y)XY\; (2.19) 

z 4 (X,Y) = ±\XYXY\. (2.20) 

( 2 . 21 ) 

The word versions of z 5 , Zq , Z 7 , and z%, and the commutator versions of z 3 and zq, are presented below. 


III. THE SIMPLIFIED ALGORITHM: SOME MEDIUM-LOW ORDER TERMS — z 5 TO 2 S 

In this Section we reproduce explicit results for Z5, ze, Z7, and z%. Beyond this stage the formulae are simply too 
cumbersome to be usefully written down on paper. The only scientific justification for explicitly presenting even this 
level of detail is that it explicitly demonstrates the patterns and symmetries of the Goldberg coefficients g(w) in a 
somewhat non-trivial context. 


A. The Z 5 term 

For z 5 we have: 


yiyzx 3 y 4 x 3 


xiy 2 x 3 y 4 x 5 

, yix 2 y 3 x A y 3 


2/1*22/3*4*5 

2/1*22/32/4*5 

2/1*2*32/4*5 

120 


30 

+ 30 


120 


120 

120 

2/12/22/3Z/4Z5 


£12/22/32/4*5 

£1*2 2/32/4*5 

L 

*1*2*32/4*5 


2/1^22/32/42/5 

2/1*2*32/42/5 

720 


180 

120 


180 


180 

120 

yix 2 x 3 X4,y 5 


yix 2 x 3 x 4 x 5 

2/12/2*32/42/5 


2/12/2*3*42/5 

L 

2/12/2*3*4*5 

*12/2*32/42/5 

180 


720 

120 


120 


180 

120 

x\y 2 x 3 x 4 y 3 


xiy 2 x 3 x 4 x 5 

2/12/22/3*42/5 


2/12/22/3*4*5 


*12/22/3*42/5 

*12/22/3*4*5 

120 


180 

180 


180 


120 

120 

Xix 2 y 3 x 4 y 5 


x 4 x 2 y 3 x 4 x 3 

^12/22/32/42/5 


*1*22/32/42/5 

L 

*1*2*32/42/5 

xix 2 x 3 x 4 y 5 

120 


120 

720 


180 


180 

720 


( 3 . 1 ) 
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Converting Xi —> X, and yi —> Y so as to get a representation in terms of words in the {X, X} alphabet, we obtain: 


Y 2 XYX | XYXYX | YXYXY YXYX 2 YXY 2 X YX 2 YX 
120 + 30 + 30 120 120 120 

y 4 x i xy 3 x x 2 y 2 x i x 3 xx i yjy 3 xx 2 x 2 

720~ + 180 120 + 180 + 180 120 

i xx 3 x xx 4 y 2 xy 2 x 2 x 2 x t x 2 x 3 xyxy 2 

' 180 720 120 120 + 180 120 

xxx 2 x i xxx 3 i y 3 xy i x 3 x 2 xy 2 xy xx 2 x 2 
120 + 180 + 180 + "T80 120 120 

X 2 YXY X 2 YX 2 XX 4 | X 2 X 3 | X 3 X 2 X 4 X 
120 120 720~ + 180 + ~T80 720~‘ 


(3.2) 


Converting to right-nested commutators to get a Dynkin representation is tedious (due to antisymmetries and the 
Jacobi identity). Blanes and Casas @ report the equivalent of 


z 5 (X, X) = i (-[X 4 X] + 6[XXXXX] + 2[XX 3 X] + 2[XX 3 X] + 6[XXXXX] - [X 4 X]) . (3.3) 


B. The ze term 


For zq we have: 

2/1^22/3^42/5^6 

z 6 — ~ 


2 / 12 / 22 / 3 ^ 42 / 52:6 . yiy2X 3 y 4 x 5 x 6 


xiy 2 x 3 y 4 x 5 y 6 


60 


360 

+ 

240 

+ 

240 

+ 

240 

+ 

60 

yix 2 y 3 X 4 :X 5 xe 


yiX 2 y 3 yiX 5 x 6 


yix 2 x 3 y 4 x 5 x 6 


y\X 2 y 3 y 4 y 3 XQ 


yix 2 x 3 y 4 y 5 xe 


yix 2 x 3 x 4 y 5 x 6 

360 


240 


240 


360 


240 


360 

2 : 12 / 22 / 32 : 42 / 52/6 


xiy 2 y 3 XAX 3 y& 


x\x 2 y 3 x 4 y 3 y G 


xix 2 y 3 x 4 x 5 y 6 


yiy2y 3 y 4 x 5 x e 


2 : 1 2/22/32/42’52/e 

240 


240 


240 


240 


1440 


360 

2 : 12 : 22 / 32 / 42 : 52/6 

+ 

Xix 2 x 3 y 4 x 5 y 6 

+ 

yiy 2 x 3 x 4 x 5 x 6 

+ 

xiy 2 x 3 y 4 y 5 y 6 

_ 

x\y 2 x 3 x 4 y 5 y & 

+ 

Xiy 2 X 3 x 4 x 5 y 6 


240 


360 


2/12/22/32:42:52:6 , xix 2 x 3 y 4 y 3 y G xix 2 x 3 x 4 y 5 y 6 x^y^^ye 


360 

Converting Xi —> X and y,; 


360 1440 1440 ' 

X, so as to get a representation in terms of words in the {X, X} alphabet, we obtain: 


ze(X,Y) 


YXYXYX Y 3 XYX | X 2 XXX 2 i Y 2 XY 2 X i X 2 X 2 XX i XXXXXX 
60 360 + 240 + 240 + 240 + 60 

XXXX 3 | XXX 2 X 2 | XX 2 XX 2 XXX 3 X | XX 2 X 2 X XX 3 XX 
360 + 240 + 240 360 + 240 360 

XX 2 XX 2 XX 2 X 2 X X 2 XXX 2 X 2 XX 2 X | X 4 X 2 ( XX 3 XX 
240 240 240 240 + 1440 + 360 

X 2 X 2 XX X 3 XXX X 2 X 4 XXXX 3 XXX 2 X 2 XXX 3 X 


240 360 1440 360 240 360 

X 3 X 3 X 3 X 3 X 4 X 2 X 2 X 4 

360 + 360 1440 1440 ' 


(3.5) 


Converting to right-nested commutators to get a Dynkin representation is tedious (due to antisymmetries and the 
Jacobi identity). Blanes and Casas @ report the equivalent of 


s 6 (X,X) = (—2[X 2 X 2 XX] + 6[XXXXXX] - [XX 4 X] + [XX 4 X]) . 


(3.6) 
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C. The z 7 term 


For z 7 we have: 



xiy 2 X 3 y 4 X 5 y 6 x 7 


2/12/2232/4251/627 


2/1x22/32/4252/6X7 


2/122x32/4x52/6X7 

2/1222/3242/5262/7 

z 7 

140 


840 


840 


840 

140 

+ 

2/1222/3X42/5X6X7 

+ 

2/1222/3242/52/627 

+ 

2/1222/3X4252/6X7 


2122X32/4252/6X7 

2/12/22/32/4252/627 

840 

840 

840 


630 

2016 


£12/22/32/4252/62+ 

L 

2i2 2 2/32/42 5 2/627 

L 

212/22/3242/5262/7 


212/22/3242/52627 

212/22/3242/52/627 


630 


840 


840 


840 

840 


2l2/22/3242 5 t/627 

4- 

21X22/3242/5262/7 

4- 

21 222/3 242/5 2 6 27 

4 

21222/3242/52/627 

21222/324252/627 

840 


840 


840 


840 


840 


2/12/22/3242/5262/7 


2/i 2/2 2/3 242/5X6 27 


2/12/22/3242/52/627 


2/12/22/324X52/6X7 

212/2X32/4X52/62/7 


630 


5040 


5040 


5040 

840 

+ 

2i2/22 3 2/42 5 262/7 


212/2X32/4252627 

+ 

212/2X32/42/5X62/7 

+ 

212/2232/42/52627 

212/223X42/5X62/7 

840 


630 

840 

840 

840 

+ 

2l2/22 3 X42/52627 


212/2232/42/52/627 

l 

2i2/22 3 2 4 2/52/627 


212/2X324X52/627 

2/12/2X32/4X52/62/7 

840 


630 


840 


630 

840 


2/l2/22 3 2/42 5 2 6 2/7 


2/i 2/223 2/4X5 2 6 2 7 


2/12/2232/42/5X62/7 


2/12/2232/42/5X6X7 

2/12/223X42/5X62/7 


840 


5040 


840 


1120 

840 


2/12/223242/52627 


2/12/2232/42/52/627 


2/12/223242/52/627 


2/12/22324252/627 

2/1X22/3X425X62/7 


1120 


5040 


1120 


5040 

630 

+ 

2/l222/32 4 2 5 2 6 27 

_L 

2/1222/32/4252/62/7 

l 

2/1222/32/425262/7 


2/1222/32/425X627 

2/1X2X32/4X52/62/7 

2016 


840 


840 


5040 

840 


2/122232/425262/7 


2/122231/425X627 


2/1222/32/42/5262/7 


2/1222/32/42/52627 

2/122232/42/5X62/7 


840 


5040 


630 


5040 

840 


2/122232/42/52627 


1/122X3242/5262/7 


1/1x223242/52627 

4 

2/1222/32/42/52/627 

2/122232/42/52/627 


1120 


630 


5040 


2016 

5040 


2/lX 2 2 3 2 4 2/52/627 


2/1X22324252/6X7 


2/1222/3242/52/62/7 


2/1222/324252/62/7 

212/22/32/42/52/627 


5040 


2016 


630 


840 

5040 


21222/32/42/52/627 


21 x 2 23 2/42/52/6 27 


21x2X3x42/52/6X7 


212223X4252/6X7 

212/22/32/4252627 


2016 


1512 


2016 


5040 

1512 


2i2 2 2/32/42 5 2/62/7 


21222/32/4X5X6 2/7 


21222/32/425X6X7 


Xix 2 x 3 y 4 x 5 y 6 y 7 

21X2X32/4X5262/7 


1120 


1120 


5040 


5040 

5040 


2l22232/42 5 2 6 27 


2/12/22/32/42/5262/7 


2/12/22/32/42/52627 

+ 

212/22/32/42/5X62/7 

212/22/32/42/5X6X7 


1512 


5040 


5040 

2016 

2016 


212 2 2/32/42/52 6 2/7 


21X22/32/42/52627 


21X2 232/42/5262/7 


2122232/42/52627 

2l 22X3242/5262/7 

2016 


5040 


5040 


5040 


5040 

L 

21X223X41/52627 

_L 

2/12/22/32/42/52/627 


212/2232425X627 


2/12/22/3242/52/62/7 

2/12/22/3X4X52/62/7 


2016 


30240 


5040 


1512 

5040 


2/i 2/22/3242 5 2 6 2/7 


2/12/22/32425X6X7 


212/22/3X42/52/62/7 


212/22/324252/62/7 

212/22/32425262/7 


1512 


3780 


5040 


1120 

5040 


212/22/32 4 2 5 2 6 2 7 


21X22/3X41/52/62/7 


21X22/3X4252/62/7 


21X22/32425X62/7 

21222/324X5X6X7 


2016 


5040 


1120 


5040 

2016 

+ 

2/12/22/32/4252/62/7 


2/12/22/32/425X62/7 ! 

2016 

2/12/22/32/425X6X7 


212/22/32/4252/62/7 

2l 2/22/32/425262/7 

2016 


3780 


5040 

5040 


2/1222/32/42/52/62/7 


2/l2 2 2 3 2/42/52/62/7 


2/122X3X42/52/62/7 

j_ . 

2/1X22324X52/62/7 

2/1222324X5X62/7 


5040 


2016 


1512 


2016 

5040 


2/1X22324X526X7 

+ 

2/12/2232/42/52/62/7 


2/12/223X42/52/62/7 


2/12/22324252/62/7 

2/l2/2232 4 2 5 2 6 2/7 


30240 

2016 


5040 


5040 

2016 


2/12/22324X526X7 


212/2X32/4 2/5 2/62/7 


212/2X3X42/5 2/6 2/7 


212/2X324X52/62/7 

2i2/22 3 242 5 262/7 


5040 


2016 


5040 


5040 

2016 

L 

212/2 2/32/42/52/6 2/7 


XlX 2 2/3 2/42/5 2/6 2/7 ; 

5040 

2122232/42/52/62/7 


21X2X3242/52/62/7 

2i 2 2 2 3 2 4 252/62/7 


30240 


3780 


3780 

5040 


XiX 2 X 3 X4X 3 XQy 7 

30240 


(3.7) 


There are 63 occurrences of each 2 , above, and 63 occurrences of each y-i. (Note that 63+63 = 126 = number of terms.) 
































































































































Converting Xi —> X and yi —>• Y so as to get a representation in terms of words in the { X , Y } alphabet we obtain: 


XYXYXYX Y 2 XYXYX YXY 2 XYX YX 2 YXYX YXYXYXY 
140 1 840 1 840 1 840 140 

YXYXYX 2 YXYXY 2 X YXYX 2 YX X 3 YXYX Y'XYX 
+ 840 + 840 + 840 630 + 2016 

xy 3 xyx x 2 y 2 xyx xy 2 xyxy xy 2 xyx 2 xy 2 xy 2 x 

630 1 840 1 840 4 840 1 840 

xy 2 x 2 yx x 2 yxyxy x 2 yxyx 2 x 2 yxy 2 x x 2 yx 2 yx 

4 840 1 840 4 840 4 840 4 840 

y 3 xyxy y 3 xyx 2 y 3 xy 2 x y 3 x 2 yx xyxyxy 2 

630 5040 5040 5040 + 840 

x yxyx 2 y xyxyx 3 xyxy 2 xy xyxy 2 x 2 xyx 2 yxy 

+ 840 630 + 840 + 840 + 840 

xyx 2 yx 2 xyxy 3 x xyx 2 y 2 x xyx 3 yx y 2 xyxy 2 

+ 840 630 + 840 630 + 840 

y 2 xyx 2 y y 2 xyx 3 y 2 xy 2 xy y 2 xy 2 x 2 y 2 x 2 yxy 

4 840 5040 1 840 U20 1 840 

y 2 x 2 yx 2 y 2 xy 3 x y 2 x 2 y 2 x y 2 x 3 yx yxyx 3 y 

1120 5040 1120 5040 630 

YXYX 4 YXY 2 XY 2 YXY 2 X 2 Y YXY 2 X 3 YX 2 YXY 2 
4 2016 4 840 1 840 5040 1 840 

YX 2 YX 2 Y YX 2 YX 3 YXY 3 XY YXY 3 X 2 YX 2 Y 2 XY 
+ 840 “ 5040 ~~ 630 ~~ 5040 + 840 

YX 2 Y 2 X 2 yx 3 yxy YX 3 YX 2 YXY'X yx 2 y 3 x 

1120 630 5040 1 M 5040 

yx 3 y 2 x yx 4 yx yxyxy 3 yxyx 2 y 2 xy b x 

5040 + 2016 630 + 840 5040 

x 2 y 4 x x 3 y 3 x x 4 y 2 x x 5 yx xy 3 x 3 

4 2016 1512 + 2016 5040 1512 

X 2 V 2 XV 2 X 2 Y 2 X 2 Y x 2 y 2 x 3 x 3 yxy 2 x 3 yx 2 y 

1120 ~~ 1120 ~~ 5040 ~~ 5040 ~~ 5040 

x 3 yx 3 y 5 xy y 5 x 2 xy 4 xy xy'x 2 

1512 5040 5040 4 2016 4 2016 

X 2 Y 3 X Y X 2 Y 3 X 2 X 3 Y 2 XY X 3 Y 2 X 2 X 4 YXY 

5040 5040 5040 5040 1 2016 

X 4 YX 2 Y e X XYX 5 Y 3 XY 3 Y 3 X 2 Y 2 

4 2016 + 30240 5040 1512 5040 

Y 3 X 3 Y y 3 x 4 xy 2 xy 3 xy 2 x 2 y 2 xy 2 x 3 y 

1512 + 3780 5040 1120 5040 

xy 2 x 4 x 2 yxy 3 x 2 yx 2 y 2 x 2 yx 3 y x 2 yx 4 

4 2016 5040 1120 5040 4 2016 

Y 4 XY 2 y 4 x 2 y y 4 x 3 xy 3 xy 2 xy 3 x 2 y 

+ 2016 + 2016 + 3780 ~~ 5040 ~~ 5040 

YXY 5 YX 2 Y 4 YX 3 Y 3 YX'Y 2 YX 5 Y 
5040 + 2016 1512 + 2016 5040 

YX 6 Y 2 XY 4 Y 2 X 2 Y 3 Y 2 X 3 Y 2 Y 2 X 4 Y 

+ 30240 + 2016 5040 5040 4 2016 

Y 2 X 5 | X'YXY 4 XYX 2 Y 3 XYX 3 Y 2 | XYX 4 Y 

5040 4 20l6 5040 5040 1 2016 

XY 6 X 2 Y 5 X 3 Y 4 X 4 Y 3 X 5 Y 2 
+ 30240 ~~ 5040 + 3780 + 3780 5040 

X 6 Y 
+ 30240' 


( 3 . 8 ) 


There are 2 occurrences of X 6 above, 5 occurrences of X 5 , 12 occurrences of X 4 , 28 occurrences of X 3 , 64 occurrences 
of X 2 , and 144 occurrences of X 1 . Similarly for Y. (Note 63 x7 = 2x6 + 5x5 + 12x4 + 28 x3 + 68x2 + 144 x 1.) 
Converting this to commutators in the Dynkin form is somewhat impractical without significant computer-aided 
computation. See for instance reference Q, and related online tables Q. 
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D. The zs term 

For zg we have: 

yiy2y^XAygXQy 7 xg x 1 y 2 x 3 y4X 3 y 8 x 7 y 8 yiy 2 x 3 y4ybX & y 7 x 8 yiy2Xgx A ygx e y 7 xg yiy 2 x 3 y4x 5 y 6 x 7 x 8 


1260 


280 


1680 


1680 

1680 

yiy2X 3 yiX b y 3 y 7 xg 


yiy2x 3 y4x 3 x & y 7 xg 


y\x 2 y 3 y4x 3 y 8 x 7 x 8 


2/13222/32/43752/62/7328 

yix 2 y 3 y4x 5 x 6 y 7 x 8 

1680 


1680 


1680 


1680 

1680 

yix 2 y 3 X4X 3 yQX 7 xg 


y\x 2 y 3 x4y 3 ygy 7 xg 


yix 2 y 3 X 4 X 5 y 6 y 7 x 8 

L 

yix 2 y 3 X 4 X 5 x 6 y 7 x 8 

yix 2 y 3 X4y 3 x & x 7 x 8 

1680 


1260 


1680 


1260 

1260 

yix 2 y 3 X4y 3 yex 7 xg 


y\x 2 y 3 y4y 3 x 8 y 7 xg 


Vix 2 x 3 y4y 3 x & y 7 x 8 


y 1 x 2 x 3 X 4 y 3 x 8 y 7 x 8 

yix 2 x 3 y 4 X 5 y 6 x 7 x 8 

1680 


1260 


1680 


1260 

1680 

yix 2 x 3 y4X 3 y Q y 7 xg 


yix 2 x 3 y4x 5 x 6 y 7 x 8 


2/12/22/32/42/5^62/7^8 


XiX 2 y 3 y 4 X 5 y 6 x 7 y 8 

XiX 2 x 3 y4X 5 y 6 x 7 y 8 

1680 


1680 


10080 


1680 

1260 

yiy2y 3 yAX 3 yQX 7 xg 


yiy2y 3 y4x 5 y e y 7 x 8 


yiy2y 3 y4x 5 x 6 y 7 x 8 


32i2/22/32/43252/ 6 0272/ 8 

XiX 2 y 3 X 4 X 5 y 6 x 7 y 8 

4032 


4032 


4032 


1260 

1680 

x\y 2 y 3 X4y 3 y&x 7 yg 


xiy 2 y 3 X 4 X 5 y 6 x 7 y 8 

U 

xix 2 y 3 X4y 3 x 3 y 7 y 8 


xix 2 y 3 X4y 3 x 6 x 7 y 8 

x\x 2 y 3 X 4 y 3 y 8 x 7 y 8 

1680 


1680 


1680 


1680 

1680 

yiy2y 3 x4y 3 XQX 7 xg 


yiy2y 3 x4y 3 ye,x 7 xg 


yiy2y3X4x 5 yex 7 x 8 


32l2/22/332 4 2/532 6 2/72/8 

Xiy 2 y 3 X4y 5 XQX 7 y 8 

3024 


10080 


10080 


1680 

1680 

yiy2y 3 X4y 3 y 3 y 7 xg 


yiy2y3X4X 5 yey 7 x 8 

L 

yiy2y3X4x 5 x 6 y 7 x 8 


32i2/232 3 2/432 5 2/62/72/8 

xiy 2 x 3 y 4 X 5 x 6 y 7 y 8 

3024 


10080 


3024 


1260 

1680 

xiy 2 x 3 y4x 3 x e x 7 yg 


Xiy 2 x 3 y4y 5 x 6 y 7 y 8 


xiy 2 x 3 y 4 y 5 x 6 x 7 y 8 


xiy 2 x3X4ybX 8 y 7 y 8 

Xiy 2 x 3 X4y 5 x 6 x 7 y 8 

1260 


1680 


1680 


1680 

1680 

xiy 2 x 3 y4y 3 ygx 7 yg 


x\y 2 x 3 X 4 y 3 y&x 7 y 8 


Xiy 2 x 3 X 4 X 5 y e x 7 y 8 

_L 

2/l2/23232/42/532 6 32702 8 

yiy2x 3 x4y 3 x 8 x 7 x 8 

1260 


1680 


1260 


10080 

10080 

yiy2x 3 y4y 3 y&x 7 xg 

L 

yiy2x 3 x4y 3 yex 7 x 8 

L 

yiy 2 x 3 X 4 X 5 y 6 x 7 x 8 


2/12/23232/42/52/62/7328 

2/l2/232 3 a242/52/62/732 8 

10080 


2240 


10080 


4032 

10080 

yiy2X 3 X4X 5 y 6 y 7 x 8 


yiy2X 3 X4X 5 x 6 y 7 x 8 


yiy2x 3 y4x 5 x e x 7 x 8 


yix 2 y 3 x4x 5 x 6 x 7 x 8 

y\x 2 y 3 y4X 3 x 6 x 7 x 8 

10080 


4032 


4032 


10080 

4032 

yix 2 x 3 y4,x 3 XQX 7 xg 


yix 2 y 3 y4y 3 xex 7 x 8 


yix 2 x 3 y 4 y 3 x 8 x 7 x 8 

yix 2 x 3 X 4 X 5 y 6 y 7 x 8 yix 2 x 3 X 4 X 5 x 6 y 7 x 8 

4032 


3024 


10080 


4032 10080 

yix 2 x 3 x 4 y 5 x e x 7 x s 


yix 2 y 3 y4y 3 y6x 7 x 8 


yix 2 x 3 y 4 y 5 y 6 x 7 x 8 


yix 2 x 3 x 4 y 5 y 6 x 7 x 8 

y 1 x 2 x 3 x 4 x 5 y 6 x 7 x 8 


' — 


+ 


+ 



3024 


4032 


10080 


10080 

4032 

yix 2 y 3 y4y 3 y 3 y 7 xg 


yix 2 x 3 y4y 3 y 8 y 7 x 8 


y\x 2 x 3 X4y 3 yey 7 x 8 


Xix 2 y 3 X 4 X 5 y 6 y 7 y 8 

Xix 2 y 3 X 4 X 5 x e y 7 y 8 

10080 


4032 


3024 


10080 

10080 

XiX 2 y 3 y 4 X 5 y 6 y 7 y 8 


xix 2 x 3 y 4 x 5 x e y 7 y 8 


x\x 2 y 3 y 4 y 3 x 6 y 7 y 8 


xix 2 y 3 y4y 3 x 6 x 7 y 8 

xix 2 x 3 y4y 3 x 8 y 7 y 8 

10080 


10080 


10080 


10080 

10080 

xix 2 x 3 y4y 3 xe,x 7 yg 


XiX 2 y 3 y 4 X 5 x 6 x 7 y 8 

l_ 

X\X 2 x 3 X4y 3 y 8 x 7 y 8 


x 1 x 2 x 3 x 4 x 5 y 6 x 7 y 8 x 1 x 2 x 3 X 4 y 3 x 8 y 7 y 8 

10080 


10080 


4032 


10080 4032 

xix 2 x 3 x 4 y 5 x e x 7 y s 


2/12/22/32/42/52/6^7^8 


3212/22/32/42/52/6^72/8 


32i 3222/32/42/52/63272/8 

xix 2 x 3 y 4 y5y6X 7 y 8 

4032 


60480 


10080 


4032 

3024 

23 yiy 2 y 3 y4X 3 XQX 7 x 8 

•^■12/2 2/32/4 *7-5 2/62/72/8 

xiy 2 y3y4X5x 6 x 7 y 8 

xix 2 y 3 y4X 5 x 6 y 7 y 8 

xix 2 x 3 y 4 x 5 y 6 y 7 y 8 

120960 


3024 


3024 


2240 

3024 

xix 2 x 3 y 4 x 5 x 6 x 7 y s 


2/12/22/32/42/5^62:7^8 

-L 

3212/2 2/3 2/42/5 32 6 2/7 2/8 


32i2/22/32/42/532 6 3272/s 

yiy2y3X4x 5 x 6 x 7 x 8 

3024 


10080 


4032 


4032 

10080 

x\y 2 yzX4y 3 y 3 y 7 yg 


xiy 2 y 3 X 4 X 5 x e x 7 y 8 


xix 2 y 3 X4y5yey 7 y& 


xix 2 y 3 X4X 5 x 6 x 7 y 8 

yiy 2 X3X4X 3 x & x 7 x 8 

4032 


4032 


4032 


4032 

60480 

xiy 2 x 3 y4y 3 y&y 7 y?, 

L 

xiy 2 x 3 X4y 3 y 8 y 7 y 8 


xiy 2 x 3 X 4 X 5 y 6 y 7 y 8 

1— 

xiy 2 x 3 X 4 X 5 x e y 7 y 8 

xiy 2 x 3 x 4 x 5 x 6 x 7 y 8 

10080 


4032 


3024 


4032 

10080 

xiy 2 y 3 y 4 X 5 x e y 7 y s 


xiy 2 y 3 X4X 3 y 8 y 7 y 8 


xiy 2 y3X4X 3 x 8 y 7 y 8 


321 *22/32/42/52/62/72/8 

xix 2 x 3 y4y 3 y 8 y 7 y 8 

10080 


10080 


10080 


60480 

10080 


23 xix 2 xgXjy b y 6 y 7 yg _ xix 2 x 3 X4X 5 y 6 y 7 y 8 xix 2 x 3 X4X 5 x e y 7 y 8 y\x 2 y 3 X4y 3 x 6 y 7 x 8 

120960 10080 60480 280 


There are 62 occurrences of each Xi above, and 62 occurrences of each y.-, . (Note that 62+62 = 124 = number of terms.) 
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Converting Xi —> X and yi Y so as to get a representation in terms of words in the {X, Y } alphabet we obtain: 


y 3 xyxyx xyxyxyxy y 2 xy 2 xyx y 2 x 2 yxyx y 2 xyxyx 2 

1260 ~~ 280 ~~ 1680 ~~ 1680 ~~ 1680 
y 2 xyxy 2 x y 2 xyx 2 yx yxy 2 xyx 2 yxy 2 xy 2 x yxy 2 x 2 yx 

1680 1680 1680 1680 1680 

yxyx 2 yx 2 yxyxy 3 X YXYX 2 Y 2 X yxyx 3 yx yxyxyx 3 

1680 1 1260 1680 1 1260 1 1260 

yxyxy 2 x 2 i yxy 3 xyx yx 2 y 2 xyx i yx 3 yxyx yx 2 yxyx 2 

1680 1 1260 1680 1 1260 1680 

yx 2 yxy 2 x yx 2 yx 2 yx i y 5 xyx t x 2 y 2 xyxy x 3 yxyxy 
1680 1680 + 10080 + 1680 1260 

y 4 xyx 2 y 4 xy 2 x y 4 x 2 yx xy 3 xyxy x 2 yx 2 yxy 

4032 4032 4032 1260 1 1680 

xy 2 xy 2 xy xy 2 x 2 yxy x 2 yxyxy 2 x 2 yxyx 2 y x 2 yxy 2 xy 

4 1680 1 1680 1 1680 1 1680 1 1680 
y 3 xyx 3 y 3 xy 2 x 2 y 3 x 2 yx 2 xy 2 xyxy 2 xy 2 xyx 2 y 

+ 3024 + 10080 + 10080 + 1680 + 1680 

y 3 xy 3 x y 3 x 2 y 2 x y 3 x 3 yx xyxyxy 3 xyxyx 2 y 2 

4 3024 1 10080 1 3024 1260 1 1680 

xyxyx 3 y xyxy 2 xy 2 xyxy 2 x 2 y xyx 2 yxy 2 xyx 2 yx 2 y 

1260 + 1680 + 1680 + 1680 + 1680 

xyxy 3 xy xyx 2 y 2 xy xyx 3 yxy y 2 xy 2 x 3 y 2 x 2 yx 3 
1260 1 1680 1260 h 10080 h 10080 

y 2 xy 3 x 2 y 2 x 2 y 2 x 2 y 2 x 3 yx 2 y 2 xy 4 x y 2 x 2 y 3 x 

4 10080 h 2240 1 10080 4032 h 10080 

Y 2 X 3 Y 2 X Y 2 X 4 YX Y 2 XYX 4 YXYX 5 YXY 2 X 4 
4 10080 4032 4032 4 10080 4032 

YX 2 YX 4 yxy 3 x 3 yx 2 y 2 x 3 yx 4 y 2 x yx 5 yx 

4032 1 3024 1 10080 4032 4 10080 

YX 3 Y:X 3 YXY 4 X 2 YX 2 Y 3 X 2 YX 3 Y 2 X 2 YX 4 YX 2 
4 3024 4032 1 10080 1 10080 4032 

YXY 5 X yx 2 y 4 x yx 3 y 3 x x 2 yx 2 y 3 x 2 yx 3 y 2 

4 10080 4032 1 3024 10080 10080 

x 2 y 2 xy 3 x 3 yx 2 y 2 x 2 y 3 xy 2 x 2 y 3 x 2 y x 3 y 2 xy 2 

10080 10080 10080 10080 10080 

X 3 Y 2 X 2 Y X 2 Y 2 X 3 Y X 4 Y 2 XY X-'YXY x'yxy 2 
10080 10080 + 4032 10080 + 4032 

X 4 YX 2 Y y 6 x 2 xy 5 xy x 2 y 4 xy x 3 y 3 xy 

4 4032 60480 10080 4 4032 3024 

23Y 4 X 4 XY 3 XY 3 XY 3 X 3 Y X 2 Y 2 X 2 Y 2 X 3 YXY 3 

120960 3024 3024 2240 3024 

X 3 YX 3 Y Y s X 3 xy 4 xy 2 xy 4 x 2 y y 3 x 5 

3024 4 10080 4 4032 1 4032 + 10080 

XY 2 XY 4 xy 2 x 4 y x 2 yxy 4 x 2 yx 4 y y 2 x 6 

4 4032 4 4032 1 4032 4 4032 60480 

xyxy 5 xyx 2 y 4 xyx 3 y 3 xyx 4 y 2 xyx b y 

10080 4 4032 3024 1 4032 10080 

XY 3 X 2 Y 2 xy 2 x 2 y 3 xy 2 x 3 y 2 x 2 y 6 x 3 y 5 
10080 10080 10080 + 60480 ~ 10080 

23 X 4 Y 4 X s Y 3 X 6 Y 2 YXYXYXYX 
+ 120960 ~~ 10080 + 60480 + 280 


There are 2 occurrences of X 6 above, 6 occurrences of X 5 , 14 occurrences of X 4 , 32 occurrences of X 3 , 72 occurrences 
of X 2 , and 158 occurrences of X 1 . Similarly for Y. (Note 62 x8 = 2x6 + 6x5 + 14x4 + 32 x3 + 72 x2 + 158 x 1.) 
Converting this to commutators in the Dynkin form is somewhat impractical without significant computer-aided 
computation. See for instance reference Q, and related online tables (§}. 
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IV. SOME PROPERTIES OF THE GOLDBERG COEFFICIENTS 

The Goldberg coefficients satisfy a number of interesting properties. (See for instance references [l0l - [T2| . and some 
new results presented below.) These properties can easily be checked against the explicit terms presented above. 
One reason for going out to zg is that it is actually slightly shorter (by 2 terms) than Z 7 , while zg is much longer 
(slightly over 3 times longer). Another reason for going out to zg, and no further, is that Zg contains the first Goldberg 
coefficient that is intrinsically rational, g(X 4 Y 4 ) = g{Y 4 X 4 ) = °ther Goldberg coefficients out to 8 th order 

are either zero or reciprocals of integers. 

Some key analytic results are: 

• For n > 2 the sum over words of fixed length is zero: 

E a{w) = 0- (4.i) 

\w\=n 


This easily follows from Z(X,X) = 2X, since the quantity X^| U j|=nfl , (' u; ) is the coefficient of X n in Z(X,X), 
and this coefficient is known to be zero. That this sum is zero is a necessary condition for the z n (X,Y) to be 
representable in terms of nested commutators. 

• Let |to|x = n x denote the number of letters X in the word w, and similarly for |iu|y = ny. Then we have 
\w\ = n = nx + ny- For n > 2 the sum over words of fixed nx and ny is zero: 

E 9 H = 0. (4.2) 

M X=nx\ |H Y=UY 


This easily follows from Z(sX,tX) = (s + t)X, since the quantity ^2\ w i x = nx . \wW=n Y ff( w ) 4S coefficient of 
s nxtfiYx n in Z(sX, tX), and this coefficient is known to be zero. In particular this implies g(X n ) = 0 = g(Y n ), 
though there are many other ways of convincing oneself of this. 

• Let Li denote either of the letters {X, Y}, so that an arbitrary word w of length n can be represented as 

w(L i ,m i ) = L? 1 L?*...L?<, (4.3) 

with Li Lj_|_i and > 1, while n = y~)?_i wii > q. Then [lOj 

g(w(Li, mi)) = g(w(Li, m w(i) )), (4.4) 

where n(i) is an arbitrary permutation of t g {1,2,..., q}. For example g(X a Y b ) = g(X b Y a ), while 
g{X a Y b X c ) = g(X b Y c X a ) = g(X c Y a Y b ) = g{X c Y b X a ) = g(X b Y a X c ) = g(X a Y c X b ). Identical results 
hold under the interchange X Y. Fundamentally, this symmetry is why so many of the Goldberg coefficients 
are repeated multiple times. (See Sections II and IIIII above!. 

• Let Li denote either of the letters { X , Y} : so that an arbitrary word of length n is L 1 L 2 • ■ • L n . (We now allow 
Li and L i+ 1 to possibly be equal.) Define the cyclic shift operator C by 

C(L 1 L 2 ...L n ) = L 2 ...L n L 1 . (4.5) 

Then acting on words of length n > 2 we have [l2| 

n 

E g(C m (L 1 L 2 ...L n ))=0. (4.6) 

m =0 

In particular this implies g{X n ) = 0 = g(Y n ), though there are many other ways of convincing oneself of this. 

• Define the interchange L{w) of the word w by interchanging all X -o- Y. Then because Z(X, Y) = —Z(—Y, —X) 
we have 


g(? M) = (—i) H+1 sM- 


(4.7) 
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• Define the reverse lZ(w) of the word w by 

TZ(LiL 2 ■ ■ ■ L n _iL n ) = L n L n _i... L 2 L \. (4-8) 

Then 

g(K(l(w))) = g(w). (4.9) 

That is, reversing the word and interchanging X -n- Y leaves the Goldberg coefficient invariant. This can be 
proved by inspection of the explicit formula for g(w ) appearing in Section IVl below. See particularly equation 
(15.711 and note that 

n (X(X ri Y Sl X r2 Y S2 ... X rk Y Sk )) = J x: Sk Y rk X Sk - 1 Y rk ~ 1 ... X Sl Y ri . (4.10) 

Once summed over the r,; and s,; the claimed result follows. In particular this implies 

g(Tl(w)) = (-l) |u,|+ 1 p(u)); and g(X(w)) = g(K(w)). (4.11) 

Even more specifically: Even length palindromes have Goldberg coefficient zero. For example g(X a Y 2b X a ) = 0. 
More generally, concatenating any word w with its reverse 1Z(w) one has g(w7t(w)) = 0. 

• Combining Goldberg’s permutation result with the reversal operator we see that 

g{LTLT ■ • ■ ££*') = 9{LT q L 2 q ~' • ■ • L?) = rni g( L 7 1 L™- 1 ■ • ■ (4.12) 

Thus whenever q is odd, (so that = Li), and |u>| = n = rrii is even, then the Goldberg coefficient must 
vanish 0 - For example g(XY 2 X) = g(X 2 YX) = g(XYX 2 ) = 0. More generally, for arbitrary non-negative 
integers a, b, c, we have 

g/j£-2a+ly26+2 j£-2c+l\ _ ^j£-2a-t-2y26+l j^2c+l^ _ ^^2a+ly2b-|-l j^2c+2^ — Q 

Identical results hold under the interchange X -o- Y. Fundamentally, this symmetry is why so many of the 
Goldberg coefficients for even length words are zero. 

Conversely, if we demand g(w) ^ 0 for a word of even length |u;| = 2 n then a necessary condition, (not a 
sufficient condition), is that it must be possible to write w = L™ 1 L™ 2 ... L^ q with L,; Li + \ and rrii > 1 and 

M = 2 n = 

• One of the few explicitly known evaluations of the Goldberg coefficients is 0 

g{X a Y b ) = g{X b Y a ) = (-l) a+b+1 g(Y a X b ) = (- l) a+b+1 g(Y b X a ) = ^ (4.14) 

2=1 ' ' 

where B a+ b_i are the usual Bernoulli numbers, in the usual convention where Bi = —1/2. In particular 

g(X a Y) = g(XY a ) = (-1 ) a g(Y a X) = {-l) a g{YX a ) = B a , (4.15) 

a! 

Since in this convention all other odd Bernoulli numbers are zero, for m > 1 we have 

g(X 2m+1 Y) = g(XY 2m+1 ) = g{Y 2m+1 X) = g(YX 2m+1 ) = 0, (4.16) 

while 

g(X 2m Y) = g(XY 2m ) = g(Y 2m X) = g(YX 2m ) = ± 0 . (4.17) 

( 2 m)! 

V. THE NUMBER OF NON-ZERO GOLDBERG COEFFICIENTS 

Let us denote by ff n the number of non-zero Goldberg coefficients on words of length n. That is 

#n = #{w : g{w) 7 ^ 0 and \w\ = n} . (5.1) 

What can we say about these numbers? 
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A. Bounds on 

Since there are exactly 2" words of length n on an alphabet of two letters, we certainly have < 2 n . But 
we can actually do better than that. For n > 2 the words X n and Y n never contribute to the Goldberg series, 
g{X n ) = 0 = g{Y n ), consequently for n > 2 we have < 2 ra — 2. Inspection of Table I quickly leads to the 
observation that this inequality is actually saturated whenever n is a prime number. 


TABLE I. Number of non-zero Goldberg coefficients 


n 


2" - 2 

#n/(2" - 2) 

#n/(2 n - 2) 

2 

2 

2 

1 

1.0000 

3 

6 

6 

1 

1.0000 

4 

4 

14 

2/7 

0.2857 

5 

30 

30 

1 

1.0000 

6 

28 

62 

14/31 

0.4516 

7 

126 

126 

1 

1.0000 

8 

124 

254 

62/127 

0.4882 

9 

390 

510 

13/17 

0.7647 

10 

388 

1022 

194/511 

0.3796 

11 

2046 

2046 

1 

1.0000 

12 

2044 

4094 

1022/2047 

0.4993 

13 

8190 

8190 

1 

1.0000 

14 

8188 

16382 

4094/8191 

0.4998 

15 

29776 

32766 

4961/5461 

0.9804 

16 

30124 

65534 

15062/32767 

0.4597 

17 

131070 

131070 

1 

1.0000 


To formally prove this we proceed as follows. We start with the standard result 


Then 


Z(X,Y) = ln(e x e r )=J2 

k> 0 


(-I)*" 1 

k 


E 

r i +s i >0; 

l<i<fc 


J^riysi X r2 Y S2 • • • X rk Y Sk 
ri!si!r 2 !s 2 ! • • -r k \s k \ 


z n (X, Y) 


E 

k> 0 


k 


E 

*■< + «<>(>; Ei(^i+si) = n 
l<i<fc 


X Vl Y Sl X r2 Y S2 • • • X rk Y Sk 
ri!si!r 2 !s 2 ! • • -r k \s k \ 


(5.2) 


(5.3) 


Unfortunately the “basis” used above is over-complete. For example XY 2 X = XYX°YX, etc. That is, many 
different X ri Y sl X r2 Y S2 ■ ■ ■ X rk Y Sk can contribute to the same word w(X,Y). To deal with this let us define 

W (X ri Y Sl X r 2 Y S2 ■ ■ ■X rk Y Sk ) = x Pl Y ai X P 2 Y a2 ... X PK Y (TK . (5.4) 


Here we “collapse” the symbol string X ri Y Sl X r2 Y S2 ... X rk Y Sk to generate a “word”. We do this by eliminating any 
interior zeros, and then merging adjacent indices that correspond to the same letter. At worst the only remaining 
zero indices are p\ and/or (Jk- That is, by construction Y\f =2 Pi ^ 0 ^ nf=i 1 a i an< i K < k < n. 

The Goldberg coefficients are then 


with 


d(w) = Y 

k> 0 


(-I)*" 1 

k 


E 

ri + Si > 0; 

l<i<fe 


S(w 


W(X ri Y Sl X V2 Y S2 ... X rk Y Sk )) 
ri!si!r 2 !s 2 !.. .r k ls k l 


K k 

ki = E(^ + *) = E( r ‘ + Si )- 

j =1 *=1 


(5.5) 


(5.6) 
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Here <5(u>i = w 2 ) ins the “indicator function” which equals 1 when w\ = w 2 and equals 0 when w\ ^ in 2 . 
In the sum over k we never get higher than k = |ty|, nor lower than k = K, so in fact 


g(w) = 


M 

E 

k=K 


(-1) 


k -1 


i+si>0; Ei(>"i+3i) = |u)| 


8 (w = W(X ri Y Sl X r 2 Y S2 ... X rk Y Sk )) 
ri!si!r 2 !s 2 !. ..r k \s k \ 


(5.7) 


(Proof: Obvious. Observe that |w| = ]T-(r, + s i) — 1 = k> K.) 

Let us now assume that |u>| = p is a prime. In the sum for g(w ), separate out the contribution with the highest 
value of k, (explicitly this is k = |u;| = p). Then 


g(w : \w\ =p) = 


-IIP - 1 


(- 1 ) 


E 


i + s i >0; X)i( r i+ s i)=P 
1 <i<p 


6 {w = W{X ri Y Sl X r 2 Y S2 ... X r *Y s p)) 
ri!si!r 2 !s 2 !. ,.r p \s p \ 


P-1 

+ E 

k—K 


(-1) 


k -1 


E 


8 (w = W(X ri Y Sl X r 2 Y S2 ...X rk Y Sk )) 


i +s i >0; Ei( r i+ s i)=P 
l<i<fc 


ri!si!r 2 !s 2 !... r k \s k \ 


(5.8) 


But in that first term, V* € {1,2, ...,p} we have Vi + s* = 1 so that {r^Si} = {1,0} or {0,1}; so the factorials 
trivialize. For fixed w the remaining sum in that first term has only one contribution. That is 


g(w : |w| =p) 


(— 1 ) p ~ l 

p 


p-i 

+ E 


k=K 


k 


E 

r i + s i>°; Ei(^i+Si)=P 
l<i<fc 


S(w 


W{X ri Y Sl X r 2 Y s2 ... X rk Y Sk )) 
ri!si!r 2 !s 2 !.. .r fc !s fc ! 


(5.9) 


Now suppose that X p ^ w ^ Y p , then in the trailing terms we have 1 < k < p, while ri < p and Si < p. So none of 
the trailing terms can individually have a factor p in the denominator. Thus, when the trailing terms are all summed 
and reduced to lowest rational form, there cannot be a factor p in the denominator. So there is nothing available to 
cancel the 1/p in the leading term. That is, for any word of prime length g{w : |w| = p) ^ 0, modulo the exceptional 
cases X p /m / Y p . In those two exceptional cases w = X p or w = Y p we already know g(w) = 0. That is, whenever 
|u;| = p is a prime number, we have /f p = 2 P — 2; the number of non-zero Goldberg coefficients is maximal. 

Consequently limsup(^ ra /2 ra ) = 1 and the number of terms in the Goldberg expansion grows exponentially with 
the word length. It is this observation that makes it absolutely clear that obtaining truly extensive tables of Goldberg 
coefficients is an intrinsically hopeless task. 


B. Bounds on # 2 n 

Furthermore, we note that whenever n = p + 1 is one more than an odd prime, then observationally it seems that 
we have # p +i = 2 P — 4. Let us first establish the bound that # 2rl < 2 2n_1 — 4 for n> 2. 

Consider an arbitrary word W 2 n-i of length 2 n — 1, and write it in the form rt> 2ra _ 1 = L™ 1 !™ 2 ... L™ 5 . Then two 
words of length 2 n are obtained by appending either a X orah: 

W2n-.X = W2n-lX = L ™ 1 L ™ 2 . . . L™«X' W 2 n,Y = w 2n -iX = L ™ 1 L ™ 2 . . . L™ q Y. (5.10) 

Now either g{u> 2 n-.x) = 0 or g(w 2 n:Y ) = 0. (If L\ = X, then g(w 2 n:x) = 0; if L\ = Y, then <?(u> 2n: y) = 0.) Since 
there are 2 2n_1 arbitrary words u; 2n _i, then at least 2 2 ™ -1 of the r/(u> 2n ) are zero. Additionally, for n > 2 we also 
know the four special cases g(X 2 n ~ 1 Y ) = g(Y 2 n ~ 1 X) = j(TF 2n_1 ) = g(FX 2 " -1 ) = 0, so at least 2 2n_1 + 4 of the 
g{w 2 n) are zero. Since there are 2 2n words of length 2 n, we see that at most 2 2 " -1 — 4 of them are non-zero: That 
is, # 2 n < 2 2n_1 — 4 for n > 2. Observationally it seems that this bound is saturated when 2n = p + 1 for p an odd 
prime. 

To formally prove that this bound actually saturates for 2n = p+1 consider words of the form w = L™ 1 L™ 2 ... L^ q , 
with |u>| =2 n = which we have already seen are the only possibilities for |iy| even, and write the relevant 

Goldberg coefficients as 


2n 1 

k>q 


E 

1 - 4 +« i >0 ! E i (r i +S i )=2n 
l<i<fc 


8 {L^L ™ 2 ... = W(X ri Y Sl X r 2 Y S2 ... X rk Y Sk )) 

ri!si!r 2 !s 2 !.. .r k \s k \ 


(5.11) 
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Specialize to 2n = p + 1, and separate out the top two terms 


g{LTLT...LZ^) = 


(~l) p 

p +1 


E 


S^L ™ 2 ... L™ 2q = W(X rq Y s 'X r2 Y S2 ... X^+ 1 F^+ 1 )) 


i + s i >0; Ei(r i + s i )=p+l 
l<i<p+l 


n!si!r 2 !s 2 !. ..r p+ i\s p+ i\ 


-IIP- 1 


+ 


(- 1 ) 


E 


^(L^L^ 2 ... Z4g = l^(X ri E Sl X r2 E S2 ... X r rY s r)) 


i + Si> 0; Ei(»’i + Sj)=p+1 
l<i<p 


ri!si!r 2 !5 2 !.. -r p \s p \ 


(- 1)^-1 

^ Zr* 


E 


S^L^L™ 2 ... L™ 2q = IT(X ri T Sl X r2 F S2 ... X rk Y Sk )) 


ri!si!r 2 !s 2 !.. .rk'-Sk'- 


k>q 


i + Si> 0; Ei(^i+Si)=P+l 
l<i< fe 


!.sJ 


(5.12) 


In the first line the constraints imply Vi that r* + Si = 1, whence either n = 1 and s* = 0, or = 0 and s» = 1. All 
the factorials trivialize and the remaining sum reduces to unity. Since p is an odd prime we now have 


g(L?'Lr-.-L% q ) = - 

1 

+ - 

P 


1 


p + 1 


E 


5{L^L ^ 2 ... L™ 2 " = W{X r ^Y Sq X r2 Y S2 ... X r rY s v)) 


i+ s i>°; E;Oi+Si)=p+i 

l<i<p 


ri!si!r 2 !s 2 !... r p \s p \ 


2p ^ (-i)fc-i 

f J lc 


E 


SiL™ 1 !,™ 2 ... L™ 2q = W{X rq Y s 'X r2 Y 32 ... X rk Y Sk )) 


k>q 


i + 8i> 0; Ei( r i+ S i)=p+ 1 

l<i<fc 


n!si!r 2 !s 2 !.. .r fc !s fc ! 


(5.13) 


In the second line, the constraints imply that for all but one of the i. (say for i ^ i*), we have + Si = 1, and that 
for exactly one of the i this quantity equals two, say + Sj„ = 2. Then the coefficient multiplying the 1/p in the 
second line above is 


E E 

i*= 1 r i + a i = l; r ij(c +r s * =2 

l<i<p 


S^L^'L™ 2 . ,.L/ 2q = W(X r 'Y s 'X r2 Y S2 . ..X r ”Y s >>)) 


(5.14) 


This collapses to 


p (l 1 'I 

E ^S(Li,Li.+i =X 2 ) +5 {UM,+i =XY) + -S(Li,Li.+i = ^ 2 )j- 


(5.15) 


Here, (with a minor change of notation), are the two letters at positions z* and z* + 1 when the word 

L™ 1 !™ 2 ... L™ 2q —» iii 2 ... T 2n is expanded out in full. This sum now collapses to 


\ [(2n - 1) - (2? - 1)] + q = ^P ~ q + ^ + q = 


(5.16) 


(Note that the final result for this sum is independent of whether the word begins with X and ends with Y, or begins 
with Y and ends with X.) So at this stage we have 


giL^'L™ 2 ... L™ 2q ) =-— + - + 1 

yy 1 2 2 q ) + i t 2 t 


2 p 


^ (-i)fc- 1 

1 i b 


E 


diLT'1% 2 ... L™ 2q = W{X r iY s 'X r iY s *... X rk Y Sk )) 


k>q 


i+Si> 0; Ei( r i+ s i)=P+ 1 

l<i<fc 


ri!si!r- 2 !s 2 !.. . rfc!s fe ! 


(5.17) 
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Consider the cases: 


• If g > 3, then certainly k > 3, and then each of the ri < p and s, < p. Then this last line is a sum of rational 
numbers with no factors of p in the denominator; when summed and reduced to lowest form this will be some 
rational number with no factors of p in the denominator. 


• If q = 2 then k > 2. The terms with k > 3 again have rt < p and Si < p. Among the terms with k = 2, only 
those with r* < p and Si < p contribute to the sum. Then this last line is again a sum of rational numbers with 
no factors of p in the denominator; when summed and reduced to lowest form this will be some rational number 
with no factors of p in the denominator. 


So for q > 2, (remembering JT rrii = 2n = p + 1), we have 


g(L?'L?'...L%') 


-(- (rational number with no p’s in denominator) ^ 0. 

2 P 


(5.18) 


For q = 1 the only possible contributions that have r* = p or s* = p, (and so have any chance at all of summing to 
zero), are those special cases that we knew were zero anyway, which were explicitly excluded by putting the “—4” 
in the bound. In brief, 4 ^2n < 2 2n ~ 1 — 4, and this bound is saturated whenever 2n = p + 1 with p an odd prime: 
# p+ 1 = 2 P — 4. Consequently limsup(#2n/2 2ra ) = 1/2. 


C. Comments 


This is as much as we have been able to do in terms of placing explicit bounds on and in terms of understanding 
the pattern of zeros of the Goldberg coefficients g{w). We fully expect that more could in principle be said. 

(For instance: It would be nice to know if ^ n /(2 n — 2) is locally minimum whenever n = p + 1? Or whether or not 
liminf(#„/2") = 1/2?) 


VI. SYMMETRIC BAKER—CAMPBELL—HAUSDORFF FORMULA 

Let us now consider the “symmetric” version of the Baker-Campbell -Hausdorff formula 

OO 

S{X, Y) = In (e x / 2 e r e- Y / 2 ) = ]T s n (X, Y). (6.1) 

n— 1 

Since (e' Y / 2 e r e' Y / 2 ) 1 = e~ x l 2 e' Jy e~ x ! 2 we have S(X,Y) = — S(—X, — Y), which implies that all the even-level 
terms vanish: S 2 n (X,Y) = 0. Killing off half the terms in the expansion is definitely a worthwhile simplification. To 
reconstruct the standard Baker-Campbell-Hausdorff formula one can, (using the notation LxY = [X, V]), adapt the 
usual Baker-Hausdorff lemma 


to see that 


°° t n °° r vnyl 

= «p(£*)r = £ yf y = £ ^ = [^ y ] ■ 

n =0 n=0 


e*e r = e x ' 2 {e x ' 2 e Y e~ x ' 2 ) e x ' 2 = e x ' 2 ( e (e x/2 ^“ x/2 )) e -^/ 2 = e x / 2 ^ML x/ 2 )y^ £ x/2 


( 6 . 2 ) 


(6.3) 


That is, the usual Baker-Campbell Hausdorff and symmetric Baker-Campbell Hausdorff formulae are related by 

Z(X, Y) = S(X, exp(L A72 )Y). (6.4) 

Let us compute (eg, using Maple or some equivalent package), the (N + 1) x (V + 1) matrix 


S N = In (exppGv/2) exp(Y)v) exp(AAr/2)). 


(6.5) 



17 


Again, the first row of the matrix S n is (essentially) the information we want. Specifically, noting that the matrix Sn 
is strictly upper triangular, let us denote 


' 0 

Si 

S2 

S3 

• • sn -i 

SN 

0 

0 

* 

* 

* 

* 

0 

0 

0 

* 

* 

* 

0 

0 

0 

0 

* 

* 

0 

0 

0 

0 

0 

* 

. 0 

0 

0 

0 

0 

0 _ 


Here the *’s again denote nonzero quantities that are not of specific interest for current purposes. It is now a trivial 


exercise to obtain the first few terms 

si=xi+yi\ (6.7) 

s 2 = 0; (6.8) 

S3 = ^(2*11/22/3 - XiX 2 y 3 ~ ^y\Xiy 3 - y\X 2 x 3 + 2y ± y 2 x 3 + 2x 1 y 2 x 3 )-, (6.9) 

s 4 =0. (6.10) 


Again, higher-order terms are easy to calculate, but tedious to display. Converting these low-order terms to words 
the key observations are that 


s 1 {X 1 Y) = X + Y- s 3 (X,Y) = ^(2XY 2 - X 2 Y -4YXY -YX 2 + 2Y 2 X + 2XYX). (6.11) 
Converting to commutators, we have 


s 3 (X,Y) = -±\(X + 2Y)XY\. (6.12) 

We could define “symmetric” versions of the Goldberg coefficients, g s (w ), and proceed with a fuller analysis along 
the lines above, but have not yet done so. We content ourselves (see Table II) with calculating for the symmetric 
Baker-Campbell Hausdorff expansion and comparing it with our explicit bound. Note n = 0, and that #f n+1 
again saturates whenever 2n + 1 = p is an odd prime. 


TABLE II. Number of non-zero Goldberg coefficients (symmetric BCH) 


n 


2 n - 2 

#£/(2"-2) 

#£/(2" - 2) 

3 

6 

6 

1 

1.0000 

5 

30 

30 

1 

1.0000 

7 

126 

126 

1 

1.0000 

9 

435 

510 

29/34 

0.8529 

11 

2046 

2046 

1 

1.0000 

13 

8190 

8190 

1 

1.0000 

15 

30846 

32766 

15423/16388 

0.9411 

17 

131070 

131070 

1 

1.0000 
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VII. SOME OTHER VARIANTS OF THE BAKER—CAMPBELL—HAUSDORFF FORMULA 

Now that we have seen the basic techniques employed, many other variants of the Baker-Campbcll Hausdorff 
formula can easily be extracted. For instance, consider the variants presented below. (We will not go beyond 4 th 
order in explicit formulae.) 


A. Loop Baker—Campbell—Hausdorff formula 

Consider the “loop” Baker-Campbell Hausdorff formula 

OO 

L(X, Y ) = In (e x e Y e- x e~ Y ) = ^ i n (X, Y). (7.1) 

n= 1 

This type of series is of interest, for instance, when considering parallel transport around a closed curve. Then 

h = 0; (7.2) 

4 = xiy 2 - y x x 2 \ (7-3) 

4 = ^ ( x x x 2 y 3 - 2 x x yix 3 - x x y 2 y 3 + y x x 2 x 3 + 2 y x x 2 y 3 - yiy 2 x 3 ); (7.4) 

4 = (2 xix 2 x 3 y 4 - 6 xix 2 y 3 x 4 - 3 x x x 2 y 3 y 4 + 6 x x y 2 x 3 x 4 + 6 x x y 2 x 3 y 4 + 2 x x y 2 y 3 y 4 

-2 yix 2 x 3 x 4 - 6 y\x 2 y 3 x 4 - 6 yix 2 y 3 y 4 + 3 y x y 2 x 3 x 4 + 6 y x y 2 x 3 y 4 - 2 y x y 2 y 3 x 4 ). (7.5) 

Notice the 4 term has quietly cancelled. The first few non-zero terms (in word form) become 

£ 2 {X, Y) = XY — YX\ (7.6) 

e 3 {X, Y) = i (X 2 Y - 2 XYX - XY 2 + YX 2 + 2 YXY - Y 2 X) ; (7.7) 

e 4 (X, Y) = ^ (2 X 3 Y - 6 X 2 YX - 3 X 2 Y 2 + 6 XYX 2 + 6 XYXY + 2 XY 3 

—2YX 3 - 6 YXYX - 6 YXY 2 + 3 Y 2 X 2 + 6 Y 2 XY - 2Y 3 X) . (7.8) 

When converting to commutators we make use of the results 

[X 2 Y - 2XYX - XY 2 + YX 2 + 2 YXY - Y 2 X] = [X 2 Y - 2XYX + 2 YXY - Y 2 X] 

= [(X + 2X + 2Y + Y)XY] = 3[(X + Y)XY\, (7.9) 


and 

[2 X 3 Y - 6 X 2 YX - 3 X 2 Y 2 + 6 XYX 2 + 6 XYXY + 2XY 3 
-2 YX 3 - 6 YXYX - 6 YXY 2 + 3F 2 A 2 + 6 Y 2 XY - 2Y 3 X] 

= [2 X 3 Y - 6 X 2 YX + 6 XYXY - 6 YXYX + 6 Y 2 XY - 2Y 3 X] 

= [(2.X 2 + 6 X 2 + 6 XY + 6 YX + 6 Y 2 + 2 Y 2 )XY] 

= [(8A 2 + 6 XY + 6 YX + 8V 2 )W] 

= 4[(2X 2 +3XY+ 2Y 2 )XY], (7.10) 

to deduce 

e 2 (X,Y) = [XY]; 
e 3 (X,Y) = ±[(X + Y)XY]; 

h(X,Y) = — [(2X 2 + 3XY + 2Y 2 )XY}. 


Higher-order terms are (at least in principle) straightforward. 


(7.11) 

(7.12) 

(7.13) 
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B. Triangular Baker—Campbell Hausdorff formula 

Consider the “triangular” Baker-Campbell Hausdorff formula 

OO 

T(X, Y) = In (e- x e^ x+Y ^e- Y ) = t n (X, Y). 


n— 1 


Then 


1 1 = 0; 


t2 = (xiy 2 - yix 2 ); 

h = \ {x 4 x 2 y 3 - 2 x 4 y 2 x 3 + x 4 y 2 y 3 + y\x 2 x 3 - 2 y 4 x 2 y 3 + 2 / 12 / 2 X 3 ); 

6 

t 4 = — {-x 4 x 2 x 3 y 4 + 3x 4 x 2 y 3 x 4 - x 4 x 2 y 3 y 4 - 3x 4 y 2 x 3 x 4 + 2x 4 y 2 x 3 y 4 - x 4 y 2 y 3 y 4 
+yix 2 x 3 x 4 -2y 4 x 2 y 3 x 4 + 3y 4 x 2 y 3 y 4 + y 4 y 2 x 3 x 4 - 3y 4 y 2 x 3 y 4 + 2 / 12 / 22 / 3 x 4 ). 
Notice the t 4 term has again quietly cancelled. The first few non-zero terms (in word form) become 

t 2 {X,Y) = ~(XY-YX) ] 

t 3 (X, Y) = i ( X 2 Y - 2XYX + XY 2 + YX 2 - 2YXY + Y 2 X ); 
t 4 (X, Y) = ^ ( -X 3 Y + 3 X 2 YX - X 2 Y 2 - 3 XYX 2 + 2XYXY - XY 3 


+YX 3 - 2YXYX + 3 YXY 2 + Y 2 X 2 - 3 Y 2 XY + Y 3 X) . 


Converting to commutators we obtain 


t 2 (X,Y) = --[XY}- 

t 3 (X,Y) = ^[(X-Y)XY]- 

U(X,Y) = ^[(-X 2 + XY — F 2 )XF], 


(7.14) 

(7.15) 

(7.16) 

(7.17) 

(7.18) 

(7.19) 

(7.20) 

(7.21) 

(7.22) 

(7.23) 

(7.24) 


Higher-order terms are in principle straightforward. 


C. Sum and difference Baker—Campbell—Hausdorff formula 

Consider the “sum and difference” Baker-Campbell Hausdorff formula 

OO 

SD(X,Y) = In ( e {x+Y) e (x ~ Y) ^j = sd n (X,Y). 


The first few terms are 


sdi = 2 x 4 ] 

sd 2 = -{x 4 y 2 - 2 /iX 2 ); 

sd 3 = ^ ( x 4 y 2 y 3 - 2y 4 x 2 x 3 + 2 / 12 / 2 X 3 ); 

sd 4 = — ( x 4 x 2 x 3 y 4 - 3xix 2 j/ 3 x 4 + 3 x 4 y 2 x 3 x 4 - x 1 y 2 y 3 y 4 

-yix 2 x 3 x 4 + 3 2/1X22/32/4 - 3 yiy 2 x 3 y 4 + y 4 y 2 y 3 x 4 ). 


(7.25) 

(7.26) 

(7.27) 

(7.28) 

(7.29) 
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These terms (in word form) become 

sd 1 {X,Y) = 2X; (7.30) 

sd 2 (X,Y) = -(XY-YXy, (7.31) 

sd 3 {X , Y) = \ (XY 2 - TYX 2 + Y 2 X) ; (7.32) 

sd 4 (X , y) = (X 3 Y - 3 X 2 YX + 3 XYX 2 - XY 3 - YX 3 + 3 YXY 2 - 3 Y 2 XY + Y 3 X ). (7.33) 

In commutator form this yields 

sdiiX.Y) = 2X] (7.34) 

sd 2 (X,Y) = -[XY]- (7.35) 

sd 3 (X,Y) = ^[Y 2 X}-, (7.36) 

sd 4 (X, Y) = ^[(X 2 - Y 2 )XY]. (7.37) 

Higher-order terms are in principle straightforward. 


D. Highly-symmetrized Baker—Campbell—Hausdorff formula 


Consider the “highly-symmetrized” Baker-Campbcll Hausdorff formula 

OO 

SS(X, Y) = In ^ e -(X+Y)/2 e X/2 e Y e X/2 e -(X + Y)/2^ = £ y) (7 . 38) 

n— 1 

This particular object has been carefully constructed to make many of the low-order terms vanish. Specifically all 
the ss 2 n = 0, and the ssi term has again quietly cancelled. The first few terms are 

ssi = 0; (7.39) 

ss 2 = 0; (7.40) 

SS 3 = {~xix 2 y 3 + 2 x x y 2 x 3 + 2 x x y 2 y 3 - y x x 2 x 3 - 4 y x x 2 y 3 + 2 y x y 2 x 3 ); (7.41) 

ss 4 = 0. (7.42) 

Among the first four terms (in word form) the only no-zero one is 

ss 3 (X, Y) = ^ ( -X 2 Y + 2 XYX + 2 XY 2 - YX 2 - 4 YXY + 2 Y 2 X) . (7.43) 

In commutator form this becomes 

SS3 (X, Y) = ~ [(2X - 3 Y)XY] . (7.44) 

Higher-order terms are in principle straightforward. 


E. Symmetric sum and difference Baker—Campbell—Hausdorff formula 


Consider the “symmetric sum and difference” Baker-Campbell-Hausdorff formula 

OO 

SSD(X, Y) = In (< s (x-Y)/2 e (x+Y) e (x-Y)/2 ^ = ssd n (X, Y). (7.45) 

n =1 

The first few terms are 

ssdi = 2x x ; (7.46) 

ssd 2 = 0; (7.47) 

ssd 3 = (-3 x x x 2 y 3 + 6 x x y 2 x 3 + x x y 2 y 3 - 3 y x x 2 x 3 - 2 y x x 2 y 3 + yiy 2 x 3 ) ; (7.48) 

ssd 4 = 0. (7.49) 
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Among the first four terms (in word form) the only no-zero ones are 

ssdi(X, Y) = 2X- ssd 3 {X , Y) = -L (-3 X 2 Y + 6 XYX + XY 2 - 3 YX 2 - 2 YXY + Y 2 X ). (7.50) 

Then in commutator form 

ssd 3 (X,Y) = -h(X + Y)XY}. (7.51) 

o 

Higher-order terms are in principle straightforward. 


F. Highly-symmetrized sum and difference Baker—Campbell Hausdorff formula 

Consider the “highly-symmetrized sum and difference” Baker-Campbell Hausdorff formula 

OO 

SSSD(X, Y) = In (e- Y e (x - Y)/2 e (x+y) e ( ' Y - y)/2 e- x ) = ^ sssd n (X, Y). (7.52) 

71 — 1 

The first few terms are 

sssdi = 0; (7.53) 

sssd 2 = 0; (7.54) 

sssd 3 = ^ (-3 x\x 2 y 3 + 6 xiy 2 x 3 + x 3 y 2 y 3 - 3 y\x 2 x 3 - 2 y 3 x 2 y 3 + yiy 2 x 3 ) ; (7.55) 

sssd^ = 0- (7.56) 

(Note that only the first term changes compared to the previous “symmetric sum and difference” result.) Among the 
first four terms (in word form) the only no-zero one is 

sssd 3 (X, Y) = (-3 X 2 Y + 6A 'YX + XY 2 - 3 YX 2 - 2YXY + Y 2 X) . (7.57) 

Then in commutator form 

as8d*{X,Y) = -h i {X + Y)XY\. (7.58) 

Higher-order terms are in principle straightforward. 


VIII. DISCUSSION 

We have provided a simplified version of the Reinsch algorithm for the Baker-Campbcll-Hausdorff expansion and, 
(using completely standard “off the shelf” software), used it to investigate the properties of the Goldberg coefficients. 
Some suggestive patterns and bounds, observationally motivated by inspecting the low-order and medium-high-order 
terms in the expansion were then verified by analytic proof. We have also considered various variant forms of the 
Baker -Campbell Hausdorff expansion, and inspected the low-order terms. We would argue that the main virtue of 
the simplified version of the Reinsch algorithm lies in the fact that it makes it almost trivial to carry out low-order 
“symbolic experiments”. 
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Appendix A: Some Maple code 

This fragment of Maple code will calculate the z n . (For definiteness, the first 9 of the z n ). 

restart: 

with(LinearAlgebra): 

N: =9: 

X:=Matrix(N+l,N+l): 

Y:=Matrix(N+l,N+1): 

for i from 1 to N do: X[i,i+1]:=x [i]: od: 
for i from 1 to N do: Y [i,i+1] :=y [i] : od: 

EX:=MatrixExponential(X): 

EY:=MatrixExponential(Y): 

EXY: =EX .EY: 

Z:=MatrixFunction(EXY, ln(x), x): 

for n from 1 to N do: z[n]:= Z[l,n+1]; od: 

The most time consuming part of the process is writing the results to the screen. 

for n from 1 to N do: z[n]; od; 

The first four terms (zi, Z 2 , 23 , 24 ), and some of the medium-low-order terms ( 25 , Zq , 27 , and 2s) are presented in 
full above. Modern laptops can easily calculate out to n = 13, and with a little more difficulty out to n = 17. The 
relevance of this Maple code is its extreme simplicity, using completely “off the shelf” software — we specifically do 
not make any claims regarding efficiency. 

To count the number of non-zero Goldberg coefficients on words of length ro, and compare it with the maximum 
possible value of 2 n — 2 , one can proceed as follows: 

for n from 2 to N do: print("n =", n, "#_n =", nops(z[n]), "2~n-2 =", 2~n-2 ); od; 

By not explicitly printing the z n to screen, one saves a lot of time, and the calculation can be carried out to somewhat 
higher order. 

This code, or minor variants thereof, is sufficient for one to be able to easily check the first few terms in this or 
closely related computations. For instance, the changes required to calculate the s n appearing in symmetric object 
S(X,Y ) = In (e x /‘ 2 e Y e x ! T ) are straightforward. 

EX2:=MatrixExponential(X/2): 

EY:=MatrixExponential(Y): 

EXY2 : =EX2 .EY. EX2 : 

S : =MatrixFunction(EXY2, ln(x), x): 

for n from 1 to N do: s[n]:= S[l,n+1]; od: 

Again, the most time consuming part of the process is writing the results to the screen. 

for n from 1 to N do: s[n]; od; 

Further variations on this theme are straightforward, and the relevant code will not be reproduced here. 
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